Mirror Worlds for Robots: Fricial, Artifriction, and Resonial in Low-Dimensional Interaction Projections

Physical Intelligence Research • Garlileo Lab

In traditional robotics and world model research, the default assumption is that robots need a complete 3D understanding of the environment. Cameras, LiDAR, and depth sensors are used to generate detailed 3D scene graphs to support tasks like grasping, navigation, and interaction. However, the complexity of real-world environments creates enormous computational and data burdens: the combinatorial possibilities of object geometry, materials, lighting, and environmental dynamics lead to exponentially large state spaces.

Mirror Worlds: Compressing 3D into Behavior-Relevant 2D

Instead of reconstructing full 3D scenes, we propose providing robots with a mirror-like 2D world — a compressed, task-relevant representation that preserves only the information critical for decision-making. Key features include:

Dimensionality reduction: High-dimensional visual input is projected into a 2D plane or a small set of feature channels.
Interaction-focused: Only properties directly relevant to actions are retained, such as contact points, friction coefficients, and support forces.
Task-driven: Irrelevant geometric details are discarded for tasks like grasping, locomotion, or collision avoidance.

This mirrors human vision: the eyes capture a 2D projection, and the brain infers depth, shape, and dynamics. Similarly, a robot can rely on Fricial projections — representations of friction and interaction constraints — to replace full 3D reconstruction.

Integrating Fricial, Artifriction, and Resonial

In this mirror world:

Fricial encodes the physical interaction constraints themselves, such as contact forces, resistance, and local friction at points of interaction.
Artifriction represents the robot's learned understanding of these interactions, allowing AI to infer object behaviors, grip stability, and motion outcomes.
Resonial provides a coordination layer for multi-agent or multi-object systems, maintaining coherent timing, phase relationships, and behavioral stability within the mirrored projection.

The combination ensures that the robot not only perceives the physics but also models how agents and objects behave and interact over time.

Technical Implementation

Visual Encoder
Transform camera input into a feature map that emphasizes task-relevant regions and interaction cues.
Fricial Projection Layer
Map features into a 2D Fricial space capturing contact points, friction coefficients, and support surfaces for each object.
Artifriction Module
Use learned models to estimate likely outcomes of interactions within the Fricial space, predicting grip success, slip probabilities, and dynamic behavior.
Resonial Coordination Layer
Maintain phase alignment and temporal consistency between multiple agents or objects, ensuring coherent multi-agent dynamics in the mirror world.
Low-Dimensional Policy Execution
Reinforcement learning or planning occurs in the mirror world, guiding real-world actions efficiently.
Feedback Correction
Sensor feedback (force/torque, tactile, or visual) updates Fricial and Artifriction representations, closing the loop.

Advantages

Computational efficiency: Avoids building a full 3D model, reducing state space complexity.
Data efficiency: Fewer training samples are needed compared to full 3D reconstruction.
Explainability: Interaction variables are explicit and interpretable.
Sim-to-real transfer: Policies learned in the mirror world generalize to the real world through accurate Fricial and Artifriction mapping.

Conclusion

Mirror worlds challenge the conventional approach of full 3D reconstruction. By integrating Fricial, Artifriction, and Resonial, robots operate in a compressed, interaction-focused projection space. This framework emphasizes not the shape of objects, but how they behave and interact, providing a scalable and interpretable pathway for advanced world models. Future systems may combine these layers to support complex multi-agent coordination, efficient planning, and real-world adaptability, bridging the gap between perception and action in digital and physical environments.